Distributed Large-Scale Information Filtering
نویسندگان
چکیده
We study the problem of distributed resource sharing in peer-to-peer networks and focus on the problem of information filtering. In our setting, subscriptions and publications are specified using an expressive attribute-value representation that supports both the Boolean and Vector Space models. We use an extension of the distributed hash table Chord to organise the nodes and store user subscriptions, and utilise efficient publication protocols that keep the network traffic and latency low at filtering time. To verify our approach, we evaluate the proposed protocols experimentally using thousands of nodes, millions of user subscriptions, and two different real-life corpora. We also study three important facets of the load-balancing problem in such a scenario and present a novel algorithm that manages to distribute the load evenly among the nodes. Our results show that the designed protocols are scalable and efficient: they achieve expressive information filtering functionality with low message traffic and latency.
منابع مشابه
PERFORMANCE EVALUATION OF ROUTE-BASED DISTRIBUTED PACKET FILTERING FOR DDOS PREVENTION IN LARGE-SCALE NETWORKS A Thesis
Kim, HyoJeong. M.S., Purdue University, December, 2003. Performance Evaluation of Route-based Distributed Packet Filtering for DDoS Prevention in Large-scale Networks. Major Professor: Kihong Park. This thesis studies performance evaluation of route-based distributed packet filtering (DPF) for spoofed distributed denial of service (DDoS) attack prevention in large-scale networks under dynamic n...
متن کاملDistributed piecewise filtering design for large-scale networked nonlinear systems
This paper investigates the problem of distributed piecewiseH∞ filtering for discrete-time large-scale nonlinear systems. The considered large-scale system is composed of a number of nonlinear subsystems and exchanges its information through communication network. Each nonlinear subsystem is described by a Takagi-Sugeno (T-S) model, and data-packet dropouts happen intermittently in communicatio...
متن کاملDistributed multi-agent Load Frequency Control for a Large-scale Power System Optimized by Grey Wolf Optimizer
This paper aims to design an optimal distributed multi-agent controller for load frequency control and optimal power flow purposes. The controller parameters are optimized using Grey Wolf Optimization (GWO) algorithm. The designed optimal distributed controller is employed for load frequency control in the IEEE 30-bus test system with six generators. The controller of each generator is consider...
متن کاملHierarchical Filtering-based Monitoring System for Large-scale Distributed Applications
On-line monitoring of large-scale distributed (LSD) applications is an eeective means to observe the appli-cations' behavior at run-time and provide status information required by debugging and management tools. In this paper, we describe and motivate the architecture and the components design of a scalable, high-performance, dynamic and non-intrusive monitoring system for LSD applications. The...
متن کاملUsing WordNet as a Knowledge Base for Measuring Semantic Similarity between Words
In this paper we propose the use of WordNet as a knowledge base in an information retrieval task. The application areas range from information filtering and document retrieval to multimedia retrieval and data sharing in large scale distributed database systems. The WordNet derived knowledge base makes semantic knowledge available which can be used in overcoming many problems associated with the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Trans. Large-Scale Data- and Knowledge-Centered Systems
دوره 13 شماره
صفحات -
تاریخ انتشار 2014